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ABSTRACT 

Ocean  assimilation  systems  synthesize  diverse  in  situ 
and  satellite  data  streams  into  four-dimensional  state 
estimates  by  combining  the  various  observations  with 
the  model.  Assimilation  is  particularly  important  for  the 
ocean  where  subsurface  observations,  even  today,  are 
sparse  and  intermittent  compared  with  the  scales  needed 
to  represent  ocean  variability  and  where  satellites  only 
sense  the  surface. 

Developments  in  assimilation  and  in  the  observing 
system  have  advanced  our  understanding  and  prediction 
of  ocean  variations  at  mesoscale  and  climate  scales.  Use 
of  these  systems  for  assessing  the  observing  system 
helps  identify  the  strengths  of  each  observation  type. 
Results  indicate  that  the  ocean  remains  under-sampled 
and  that  further  improvements  in  the  observing  system 
are  needed. 

Prospects  for  future  advances  lie  in  improved  models 
and  better  estimates  of  error  statistics  for  both  models 
and  observations.  Future  developments  will  be 
increasingly  towards  consistent  analyses  across 
components  of  the  Earth  system.  However,  even  today 
ocean  synthesis  and  assimilation  systems  are  providing 
products  that  are  useful  for  many  applications  and 
should  be  considered  an  integral  part  of  the  global  ocean 
observing  and  information  system. 

1.  INTRODUCTION 

In  environmental  data  assimilation,  models  arc  used  to 
synthesize  diverse  in  situ  and  satellite  data  streams  into 
a  single  product  (an  analysis  or  state  estimate), 
combining  the  various  observations  with  the  model, 
each  according  to  its  strengths.  Assimilation  systems  are 
particularly  important  for  the  ocean  where  subsurface 
observations,  even  today,  arc  sparse  and  intermittent 
compared  with  the  scales  needed  to  represent  ocean 
variability  and  where  satellites  only  sense  the  surface. 
The  resulting  analysis  provides  an  integrated  view  of  the 
information  from  the  various  observations  as  well  as 
derived  estimates  of  unobserved  quantities  such  as 
currents,  property  transport,  overturning  circulations, 
etc. 

For  ocean  data  assimilation,  international  cooperation 
under  the  auspices  of  the  Global  Ocean  Data 
Assimilation  Experiment  (GODAE)  had  an  indelible 
impact  on  the  routine  generation  of  synthesis  products 
and  the  outreach  to  user  communities  [1].  GODAE  not 
only  demonstrated  the  feasibility  of  global  ocean  data 
assimilation  but  also  made  important  contributions  to 
the  establishment  of  an  effective  and  efficient 
infrastructure  for  global  operational  oceanography  that 
includes  the  required  observing  systems,  data  assembly 
and  processing  centres,  modelling  and  data  assimilation 
centres,  and  data  and  product  servers. 


Many  successful  applications  from  GODAE  systems  arc 
highlighted  in  [2]:  monitoring  and  prediction  of  marine 
pollution,  forecasts  for  safety  and  effectiveness  of 
operations  at  sea,  information  and  tactical  decision  aids 
to  assist  Naval  operations,  operaiional  use  of  upper 
ocean  heat  content  information  to  forecast  the  intensity 
of  tropical  cyclones,  and  ocean  initialization  of  coupled 
climate  forecast  systems  Several  examples  of  the  use  of 
ocean  current  forecasts  by  national  coast  guard  agencies 
for  safety  applications  at  sea,  including  drift  calculations 
for  search  and  rescue,  iceberg  drift  calculations,  ice 
cover  prediction,  and  safety  of  offshore  operations  arc 
provided  by  [3].  Their  conclusion  is  that  improved 
GODAE  Ocean  View  analyses  will  help  save  fuel, 
increase  efficiency,  and  improve  safety.  Reference  [4] 
provides  several  examples  of  the  use  of  GODAE 
operational  ocean  data  products  by  oil  spill  monitoring 
and  prediction  providers  around  the  world.  Benefits  of 
using  GODAE  products  in  oil  spill  modeling  include: 
improved  prediction  accuracy,  particularly  when  used  as 
nesting  data  for  local  high-resolution  ocean  models; 
global  coverage,  whereby  any  oil  spill  model  can,  in 
principle,  be  applied  anywhere  in  the  world  ocean;  and 
facilitating  alternative  forecasts  for  an  incident  from 
other  oil  spill  models. 

Ocean  assimilation  has  also  demonstrated  its  utility  for 
climate  applications.  In  particular,  ocean  analyses  have 
been  undertaken  for  the  initialization  of  seasonal 
forecasts  since  the  mid- 1990’s  (e.g  ,  [5],  [6],  [7]).  Now, 
several  operational  centres  around  the  world  issue 
routine  seasonal  forecasts  from  coupled  occan- 
atmosphcrc  models  initialized  from  ocean  and 
atmospheric  analyses  ([8]).  Historical  ocean  rcanalyscs 
and  the  associated  forecasts  provide  information  for 
forecast  calibration  and  skill  assessment.  These  products 
also  offer  a  means  for  monitoring  the  state  of  Earth’s 
climate,  providing  information  on  precursors  to  climate 
variations  such  as  the  build  up  of  warm  water  in  the 
western  equatorial  Pacific  prior  to  an  El  Nifto  event 
(e  g.,  [9]). 

Advances  in  assimilation  systems  for  the  ocean,  the 
organization  of  the  archive  of  historical  in  situ  data  and 
the  onset  of  the  era  of  high  quality  altimetry,  and,  most 
importantly,  the  availability  of  long  atmospheric 
reanalyscs  spurred  the  generation  of  historical  ocean 
reanalyses  for  the  study  of  climate  variability  (e.g.,  [5], 
[10],  [11],  inter  alia).  Today,  a  suite  of  global  ocean 
climate  synthesis  products  is  available,  some  extending 
for  a  duration  of  50  years,  others  focusing  on  the  data 
nch  period  of  the  ocean  satellite  era,  roughly  from  1993 
to  the  present.  The  number  of  studies  using  these 
products  for  oceanographic  and  climate-related  studies 
covers  a  wide  range  of  topics,  as  highlighted  in  [12]: 
ocean  circulation  studies,  sea  level  variability,  water- 
mass  pathways,  estimation  of  surface  fluxes  and  river 
runoff,  and  interannual  and  decadal  variability  of  the 


upper-ocean  and  heat  content.  They  have  also  been 
applied  to  research  in  other  disciplines  such  as 
biogeochemistry  and  geodesy. 

Increasingly,  models  and  assimilation  systems  arc  being 
used  to  provide  information  about  the  current  observing 
system  and  to  help  in  the  design  plans  for  new 
observations.  Whether  as  a  user  of  observations  or  a 
contributor  to  evaluation  of  the  observing  system,  ocean 
data  assimilation  systems  are  now  an  integral  part  of  the 
global  ocean  observing  and  information  system. 

This  paper  gives  a  brief  summary  of  synthesis  and 
assimilation  systems,  with  a  focus  on  the  observing 
system.  Section  2  provides  a  brief  overview'  of  the  state 
of  the  art  of  ocean  data  assimilation,  and  Section  3  the 
challenges  being  faced  particularly  for  ocean  syntheses 
of  the  climate  record.  Section  4  provides  a  view  of 
emerging  developments  and  the  path  forward  for  the 
ocean  data  assimilation  as  an  integral  part  of  the  global 
ocean  observing  and  information  system.  The  paper 
concludes  with  some  recommendations  for  both 
observations  and  assimilation  systems  in  Section  5. 

2.  THE  CURRENT  GLOBAL  SYNTHESIS 
EFFORTS 

An  ocean  analysis  is  obtained  by  minimizing  the  scalar 
cost  function 


^(/i(x)-y)rR  '(/i(x)-y)  +  y(. 


w  ith  respect  to  the  control  vector,  x,  which  is  usually  the 
state  vector  (temperature,  salinity,  currents,  sea  level, 
sea-ice  concentration,  tracers,  phytoplankton,  nutrient 
concentrations,  etc.)  or  related  variables.  The  vector  x^ 
represents  the  background  or  prior  estimate  of  x  from 
the  model  integration,  and  P  is  its  expected  error 
covariance.  The  vector  y  contains  the  available 
observations,  the  operator  h(x)  simulates  these 
observations  from  x,  and  R  is  the  expected  covariance 
of  the  instrument  plus  representativeness  errors 
associated  with  the  observations.  The  superscript  T 
denotes  the  transpose  operation.  The  term  Jc  represents 
constraints  that  are  applied  in  some  situations,  such  as 
the  preservation  of  model  w  ater  masses  or  a  penalty  for 
negative  nutrient  concentrations. 

A  variety  of  methods  arc  used  to  perform  ocean  state 
estimation  on  a  routine  basis,  ranging  from  relatively 
simple  and  computationally  efficient  sequential  or  filter 
methods  such  as  optimal  interpolation  (Ol),  asymptotic 
Kalman  filters,  and  the  3-dimensional  variational 
(3DVar)  method,  to  sophisticated  and  computationally 
intensive  methods,  including  ensemble  methods  that 


provide  state-dependent  estimates  of  the  multivariate 
background  error  covariances,  Kalman  smoothers,  and 
the  four-dimensional  variational  (4DVar)  method. 

Reference  [13]  provides  an  overview  of  the  assimilation 
methods  used  by  many  GODAE  systems,  and  [14] 
provides  an  overview'  and  comparison  of  twelve  of  the 
systems  dev eloped/ope rated  during  GODAE.  Most  real¬ 
time  operational  oceanography  systems  have  1/10°  or 
finer  horizontal  grid  spacing,  have  a  global  capability, 
and  make  use  of  community  ocean  models  (e.g.,  the 
HYbrid  Coordinate  Ocean  Model  (HYCOM),  Version  4 
of  the  Modular  Ocean  Model  (MOM4),  or  the  Nucleus 
for  European  Modelling  of  the  Ocean  (NEMO)). 
References  [8]  and  [12]  provide  information  on  many  of 
the  systems  used  for  climate  applications.  Because  of 
their  need  to  synthesize  the  historical  data  stream, 
climate-focused  systems  tend  to  have  coarser  resolution, 
from  about  0  25°  to  about  2°,  usually  with  some 
equatorial  refinement.  Surface  fluxes,  which  drive  the 
ocean  models,  come  from  various  sources,  especially 
from  atmospheric  reanalyses. 

Many  data  types  are  routinely  synthesized  to  produce 
ocean  state  estimates  The  most  commonly  used  are  sea- 
surface  height  (SSH)  anomalies  from  altimeters  (e.g., 
TOPEX/Poscidon  and  JASON- 1,  -2  [15]),  in  situ 
temperature  (T)  profiles  from  XBTs/C  TDs  [16,  17],  the 
global  tropical  moored  buoy  array  [18],  and  Argo  [19] 
and  in  situ  salinity  (S)  profiles  from  Argo.  Sea  surface 
temperature  (SST)  estimates  (e.g.,  [20],  [21])  arc  either 
assimilated  directly  or  used  as  a  surface  boundary 
condition.  Salinity  data  from  CTDs  and  velocity  data 
from  current  meters  and  acoustic  Doppler  current 
profilers  (ADCPs)  arc  usually  used  for  validation. 
Gravity  measurements  from  the  Gravity  Recovery  and 
Climate  Experiment  (GRACE)  and  the  newly  launched 
Gravity  field  and  steady-state  Ocean  circulation 
Explorer  (GOCE)  provide  estimates  of  the  geoid  for  use 
with  altimetry  as  well  as  derived  bottom  pressure 
estimates  (e.g.,  [22],  [23]). 

Many  aspects  of  the  quality  of  the  ocean  analyses  have 
been  presented  in  papers  documenting  the  various 
systems  as  well  as  in  papers  presented  at  the  GODAE 
Final  Symposium  As  an  example  of  the  latter,  [24] 
presents  validation  and  intercomparison  studies,  focused 
on  operational  products,  conducted  through  GODAE. 
The  validation  was  approached  through  standardized 
output  and  a  set  of  detailed  diagnostic  metrics.  The 
analyses  of  the  general  ocean  circulation  w  ere  show  n  to 
be  in  agreement  with  known  patterns  Day-to-day 
representations  of  eddy  fields  varied,  but,  statistically, 
the  ocean  variability  was  similar  among  the  systems. 

On  climate  timescales,  many  assimilation  groups  in  the 
U.S.,  Europe,  and  Japan  participated  in  a  cooperative 
effort  through  the  World  Climate  Research 
Programme’s  (WCRP’s)  Climate  Variability  and 


Predictability  Project  (CLIVAR)  Global  Synthesis  and 
Observations  Panel  (GSOP)  and  GODAE  to  compare  a 
suite  of  diagnostic  quantities  derived  from  the  different 
ocean  reanalyses  and  also  from  observations.  Some 
results  are  presented  below  and  in  [25].  More  details  are 
at  http://www.elivar.org/organization/gsop/gsop  php 

Here  we  focus  on  the  interplay  between  the  analyses 
and  observations  and  how  ocean  data  assimilation  can 
inform  decisions  regarding  investments  in  the  Global 
Ocean  Observing  System  (GOOS). 

Various  studies  have  shown  that  different  data  types 
provide  complementary  information  from  the  GOOS. 
Reference  [26]  undertook  analyses  of  the  observing 
system  for  mesoseale  applications  (see  Section  4.2). 
They  infer  that  all  observation  types  are  required  for 
constraining  mesoseale  circulation  models.  An  example 
from  climate  analyses  is  that  from  the  ECCO  group 
(Estimating  the  Circulation  and  Climate  of  the  Ocean, 
see  [27])  who  incrementally  added  different  data  types 
to  an  ECCO  synthesis  for  2006.  Fig.  1  shows  the 
differences  in  root  mean  square  (RMS)  variability  of  the 
global  meridional  overturning  circulation  from  their 
study  as  different  data  sets  are  added  The  results  imply 
that  overturning  uncertainties  would  exceed  3  Sv  in  the 
absence  of  the  modem  observing  capabilities.  While  the 
specific  value  may  be  particular  to  this  experiment, 
uncertainties  of  at  least  this  magnitude  are  likely  to 
prevail  in  state  estimates  prior  to  the  emergence  of 
modern  observing  capabilities  in  the  1990’s.  The  other 
panels  show  the  impact  of  individual  data  sets.  Not 
surprisingly,  the  stronger  constraints  on  the  overturning 
circulation  eome  from  altimetry  and  the  in  situ  data 
(mostly  Argo),  while  SST  data  provides  a  weaker 
constraint. 


Even  the  most  recent  atmospheric  rcanalyses  provide 
different  estimates  of  many  aspects  of  the  atmosphere’s 
climate,  particularly  in  the  unconstrained  variables  such 
as  precipitation  and  cloud  properties.  This  is  also  the 
ease  for  the  different  ocean  state  estimates:  the 
underlying  models  and  assimilation  approaches  differ, 
the  weights  (prior  background  and  observation  error 
covariances)  given  to  observations  and  models  differ,  as 
do  the  data  selections.  The  atmospheric  forcing  used 
also  often  differs.  Of  course  one  expects  that  the  largest 
differences  between  state  estimates  would  occur  in 
times  of  fewer  observations  if  different  surface  forcing 
is  applied  (see  Section  3.3)  and  the  state  estimates  in  the 
deeper  ocean  are  still  influenced  by  the  initial  states. 

Depending  on  the  application,  a  more  reliable  estimate 
of  ocean  climate  variability  may  lie  in  multi-model 
ensemble  approaches  [25].  To  proceed  with  this,  one 
needs  detailed  error  estimates  for  the  estimated  states, 
something  that  is  difficult  to  obtain  with  most  of  the 
current  assimilation  implementations.  Nevertheless,  an 
important  step  toward  improved  estimates  of  the  time- 
varying  ocean  state  and  its  transport  properties  is  to 
understand  the  uncertainties  in  each  estimate,  or  at  least 
the  differences  between  different  analysis  products. 
Analysis  intercomparisons,  such  as  those  from  GSOP 
and  GODAE  help  identify  commonalities  (confidence) 
and  differences  (uncertainty)  in  the  products.  They,  also 
help  identify  how  or  when  observations  constrain  the 
estimation  effectively. 


Figure  /.  The  impact  of  the  modern 
GOOS  on  the  global  meridional 
overturning  circulation  (in  Sv)  from 
the  ECCO  synthesis  [27].  The  panels 
show  RMS  variability  differences 
between  ocean  state  estimates,  based 
on  bi-weekly  averaged  fields  over  the 
year  2006  Top  left'  the  impact  of 
jointly  adding  SST,  SSI l  and  in  situ 
(T,  S)  data  to  a  baseline  estimate 
consisting  only  of  hydrographic 
climatology.  Top  right  bottom  left . 
bottom  right  :  the  impact  of  adding, 
respectively,  SST,  SSI l,  and  in  situ  (T, 
S)  data  individually 


As  one  of  the  outcomes  of  the  GSOP  intercomparison, 
[25]  compares  the  variability  in  state  estimates  from 
multi-decadal  syntheses  ([11],  [28],  [29],  [30],  [31]  [32], 
[33],  [34],  [35],  [36],  [37]).  There  is  a  large  spread  in 
the  various  estimates  of  some  quantities  such  as  global 
upper  ocean  heat  and  freshwater  content  (Fig.  2),  with 
an  even  wider  spread  in  transport  estimates  (not  shown). 
The  spread  is,  to  some  extent,  due  to  different 
approaches  or  underlying  data  sets.  However,  it  is 
somewhat  surprising  that  the  spread  increases  toward 
the  end  of  the  data  record  characterized  by  the  largest 
number  of  observations.  Reference  [25]  shows  more 
detail,  with  estimates  in  the  individual  basins.  The 
agreement  in  heat  content  is  much  better  in  the  well- 
observed  Atlantic  Ocean  (Fig.  2).  The  spread  in 
freshwater  content  variability  between  all  estimates  is 
very  large,  suggesting  a  general  problem  in  determining 
the  salt  content  from  current  observations.  This  affects 
estimates  of  many  climate  indices,  including  global  sea 
level  rise. 

Historical  ocean  state  estimates  rely  on  conventional 
ocean  observations  sueh  as  XBTs.  It  was  shown 
recently  that  XBT  data  arc  contaminated  by  previously 
undetected  errors  in  the  assumed  drop  rate  [38].  Several 
new  data-only  analyses  have  been  made  of  the  ocean 
heat  content  based  upon  corrected  XBT  fall  rates  and 
other  adjustments  to  the  basic  data  These  updated 
analyses  tend  to  remove  a  lot  of  decadal  variability 
([30],  [39],  [40],  [41]).  The  estimate  from  [30]  is 
included  in  the  upper  panel  of  Fig.  2,  whereas  none  of 
the  assimilation  products  included  corrections  to  the 
data.  Continued  efforts  arc  needed  to  address 
observational  biases  and,  as  a  corollary,  continued 
efforts  arc  needed  to  improve  ocean  reanalyscs  by  using 
the  updated  observation  databases. 

Sea  level,  or  SSI  I,  is  an  important  climate  variable 
whose  variations  have  enormous  impact  on  coastal 
communities.  Fig,  3  shows  three  synthesis  estimates  of 
local  thermostcrie  and  halosteric  SSI1  changes  over  the 
period  from  1962  to  2001,  representing  the  spread  from 
the  syntheses  presented  in  [25].  The  Simple  Ocean  Data 
Assimilation  (SODA)  [11]  uses  in  situ  profiles  to 
correct  the  model’s  T/S  structure  locally  in  space  and 
time.  Altimetry  is  projected  on  synthetic  T/S  changes 
and  only  the  latter  are  used  as  constraints.  The  results 
from  the  German  partner  of  the  ECCO  efforts  (GECCO) 
[32]  represent  the  adjoint-family  of  approaches  Also 
shown  are  results  from  the  European  Centre  for 
Medium-range  Weather  Forecasts  (ECMWF)  system 
[28],  which  is  tuned  to  improve  seasonal-to-intcrannual 
(SI)  climate  forecasts. 

Estimated  trends  over  the  entire  40-ycar  period  from 
these  three  products  differ  substantially,  especially 
between  SODA  and  GECCO.  Over  large  parts  of  the 
world  ocean,  GECCO  results  suggest  that  SSH  changes 


induced  by  heat  content  changes  and  changes  in  salinity 
counterbalance  to  some  extent  A  counter  balance  is 
less  visible  in  the  SODA  analysis  Again,  these 
estimates  arc  from  analyses  that  did  not  use  corrected 
XBT  data 


Figure  2  Global  changes  in  heal  content  in  the 
upper  700  m  (10  J,  top  panel)  and  freshwater 
content  (cm,  middle  panel).  Note  that  the  Dorningues 
heat  content  curve  uses  XBT  data  adjusted  to 
compensate  for  fall-rate  equation  errors  / 30J . 
Bottom  panel .  Changes  in  heat  content  in  the  upper 
700  m  of  the  North  Atlantic.  From  / 25 ]. 


Figure  3.  Estimates  of  local  thermosteric  (upper)  and  halosteric  (lower)  SSH  trends  over  the  period  1962  -  2001 
from  three  estimates  representing  the  spread  in  results  from  [25]  The  SODA  analysis  is  the  left-hand  column,  the 
EC  MW F  analysis  is  the  centre  column,  the  GECCO  analysis  is  the  right-hand  column 


In  the  more  recent  years  when  altimeter  data  have  been 
assimilated  (1992-2001),  large-scale  trends  in  the 
thermosteric  SSH  are  in  much  better  agreement  in  all 
three  estimates  (see  [25]).  Halostene  estimates  also  tend 
to  converge  but  substantial  differences  remain,  e.g.,  in 
the  subpolar  North  Atlantic  Interestingly,  the  high- 
resolution  simulation  (no  assimilation)  by  [42]  shows 
similar  regional  patterns  for  the  trend  over  this  recent 
decade.  They  find  that  taking  salinity  variations  into 
account  in  the  computation  of  steric  sea  level  improves 
the  comparison  with  observed  regional  sea  level  trend 
by  15%  in  RMS  difference  The  simulation  indicates 
that  regional  sea  level  changes  from  1993  to  200 1  result 
more  from  changes  in  the  ocean  circulation  than  in 
atmospheric  heat  and  water  fluxes  In  contrast,  [32] 
infer  that  surface  heat  and  freshwater  llu.xcs  contribute 
significantly  (up  to  50%)  to  sea  level  trends  over  that 
period,  especially  in  the  northern  hemisphere  and  the 
Antarctic  Circumpolar  Current  region. 

Understanding  the  differences  between  the  analysis 
products  and  making  further  improvements  in 
assimilation  capabilities  may  require  a  concerted 
comparison  effort  wherein  runs  are  undertaken  with  the 
same  data  and  forcing  and  the  analysis  diagnostics  are 
expanded  to  look  at  innovations  and  the  details  of  data 
impacts.  What  is  clear  is  that  the  salinity  estimation  and 
surface  forcing  both  impact  the  inferences  made  about 
ocean  climate  variability  and  also  prediction  skill  [8] 
using  ocean  synthesis  products 

3.  CHALLENGES 

Perhaps  the  two  greatest  challenges  for  ocean  data 
synthesis  as  a  climate  data  record  are  the  historical  data 
themselves  and  the  ability  to  make  uncertainty  estimates 


for  the  synthesis  products.  With  the  former,  the  severe 
under-sampling  of  the  water  column  and  of  most 
regions  of  the  ocean  in  the  early  periods,  the  issues  of 
biascs/uneertainties  in  air-sea  fluxes  (eg,  [43],  [44]) 
and  model  biases  [45]  take  on  greater  importance. 

3.1.  The  Global  Ocean  Observing  System 

Along  with  the  view  from  [25]  and  [46],  [47]  provides 
another  view  of  estimated  trends  in  ocean  temperature 
and  issues  faced  by  the  climate  community  because  of 
the  uneven  observational  coverage  in  both  space  and 
time.  They  find  that  the  estimated  50-year  trends  over 
most  of  the  ocean  are  not  significant  at  the  90% 
confidence  level  (CL).  In  fact  at  50  m,  only  30%  of  the 
ocean  has  a  statistically  significant  trend  with  90%  CL, 
and  the  percentage  decreases  significantly  with 
increasing  depth.  They  suggest  that  upper  ocean  heat 
content  estimates  and  trends  may  be  substantially  more 
uncertain  than  has  yet  been  acknowledged  and  that 
further  exploration  of  uncertainties  is  needed. 

Some  of  the  issues  with  data  distribution  are  obvious 
from  Fig.  4,  showing  the  non  stationarity  of  the 
observing  system,  the  under  sampling  of  the  deeper 
ocean  even  in  the  last  two  decades,  and  the  decline  of 
the  in  situ  observing  system  in  the  marginal  seas  The 
apparent  horizontal  strata  reflect  the  successive 
influence  of  450-m  XBTs,  750-m  XBTs,  500~m  tropical 
moored  buoys  and  1000-m  and  2000-m  Argo  floats.  The 
importance  of  Argo  to  the  ocean  observing  system  is 
obvious  [19].  The  sampling  situation  is  markedly  worse 
for  salinity  observations  (not  shown),  and  the 
importance  of  Argo  is  magnified  (e.g  ,  Fig.  6  below). 

1  lowever,  Argo  does  not  currently  help  in  observing  the 
marginal  seas,  so  an  alternative,  such  as  gliders  [48]  or 


reviving  the  XBT  network,  is  needed  to  address  the 
observing  system  decline  there. 
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The  changing  set  of  observation  databases  poses  another 
challenge  for  the  assimilation  community.  This  is 
particularly  so  when  different  data  sets  are  used  in 
different  systems.  The  time  to  re-synthesize  data  sets 
with  the  more  sophisticated  data  assimilation  methods 
precludes  rapid  refresh  of  synthesis  products  with 
potentially  frequent  updates  in  data  corrections.  This 
issue  also  highlights  the  importance  of  attention  to  the 
organization  of  data  sets  and  the  use  of  metadata  and 
version  control  in  the  archive  of  all  data.  Reference  [49] 
proposes  that  the  community  adopt  standards  for 
describing  and  versioning  metadata,  quality  control 
(QC),  and  observational  data 


«  :MU  X*A  M-M  -JUKI  Ww 


•*.* r*>»-  r>*  VAv^tn  t  *«OTO  P****-*!  Ciu*  o*  V***fcO  A  Cwww*' 

II1 


Figure  4.  The  number  of  global  temperature 
observations  per  month  as  a  function  of  depth.  The 
data  sources  are  XBTs,  the  global  tropical  mooring 
arrays  and  Argo  floats.  The  upper  panel  is  global; 
the  lower  panel  is  restricted  to  the  Gulf  of  Mexico 
and  the  Caribbean , 

These  changes  in  the  GOOS  over  time  have  an  impact 
on  estimates  not  only  of  trends  but  also  of  decadal 
variability  even  when  the  estimates  are  made  through 
assimilation  of  the  historical  record.  The  changes  also 
impact  the  ability  to  confidently  assess  and  calibrate 
seasonal  climate  forecast  skill. 

One  source  of  differences  and/or  deficiencies  in  the 
various  ocean  synthesis  products  is  the  input  data 
stream.  Different  choices  are  made  in  data  selection,  and 
even  different  data  sources  are  used  Systematic  data 
errors  have  been  identified  and  different  approaches 
developed  for  reducing  those  biases,  particularly  in 
XBT  temperatures,  as  mentioned  above.  For  Argo,  the 
issues  concern  float  pressure  sensors  and  salinity  sensor 
drift  (c.g.,  [19]).  Systematic  errors  of  even  1  dbar  arc  a 
concern  if  trying  to  detect  slow  signals  of  global  climate 
change.  High  quality  CTD  transects  provide  the 
standard  for  assessing  data  quality  in  profiling  floats. 
Much  work  is  yet  to  be  done  in  assembling  available 
shipboard  datasets  and  analyzing  them  jointly  with  Argo 
to  identify  and  correct  systematic  errors.  Assembling  the 
best-quality  data  is  a  multi-year  endeavour  [  1 9]. 


Differences  in  QC  procedures  also  introduce  differences 
in  assimilation  products.  A  fundamental  component  of 
any  analysis  system,  data  QC  must  correctly  identify 
observations  that  are  obviously  in  error,  as  well  as  the 
more  difficult  process  of  identifying  measurements  that 
fall  within  acceptable  ranges,  but  are  nevertheless 
erroneous.  Effective  QC  requires  a  set  of  pre- 
established,  standardized  test  procedures,  with  results  of 
the  procedures  clearly  associated  with  the  data  values 
(c.g.,  [31],  [50]).  At  present,  there  arc  few  agreed-upon 
standards  for  ocean  data  QC  and  few  cases  where  the 
procedures  and  results  from  different  groups  have  been 
compared.  The  GODAE  Quality  Control 
Intereompanson  Project  [51]  is  taking  the  first  step  by 
comparing  the  outcomes  of  profile  data  QC  procedures 
from  five  oceanographic  centres.  Results  can  be  found 
at  http://www.usgodae.org/ftp/outgoing/godac  qe. 

Even  with  the  marked  improvement  in  the  global  in  situ 
data  coverage  with  Argo,  the  current  GOOS  has  a 
number  of  serious  shortcomings.  In  addition  to  the 
decline  in  the  marginal  seas  noted  above,  the  deep  ocean 
(below  2000  m)  and  iee-eovered  regions  remain  largely 
unobserved  The  need  to  observe  the  ocean  over  the  full 
water  column  is  evident  from  ocean  analyses.  Changes 
are  not  restricted  to  the  upper  ocean  (Fig  5  and  also  the 
discussion  in  [52]  and  [53]).  Abyssal  trends  are 
apparent,  especially  in  the  Southern  Ocean  region,  but 
these  would  be  missed  in  most  of  the  current  in  situ 
observations.  Repeat  observations  from  the  World 
Ocean  Circulation  Experiment  (WOCE)  and  CL1VAR 
indicate  that  some  areas  in  the  Southern  Oceans  have 
wanned  significantly  between  700  and  3000  m  [52],  as 
have  many  regions  below  3000  m  [53],  so  it  is  not  clear 
that  the  current  ocean  state  estimates  arc  constrained 
well  enough  at  depth.  In  general,  the  deep  ocean  can  be 
expected  to  grow  in  importance  with  the  time-scale  of 
interest,  so  it  is  important  for  documentation  and  study 
of  climate  trends  as  well  as  for  longer-term  climate 
prediction  that  the  observing  system  below  2000  m  is 
improved. 


W*t!  0*4<i 


Figure  5.  Vertical  integrals  of  zonal  average 
temperature  and  salinity  trends,  translated  into 
density  trends  (abscissa,  in  kg/m2 /yr)  as  a  function  of 
latitude  (ordinate),  from  the  ECCO-GODAE 
solution  v3.73  / 54 J ,  Black:  full  water  column;  dark 
blue.  0-848  m;  red:  848-1975  m;  green:  1975-2450 
m;  cyan:  2450-5450  m.  From  [27J. 

Currently,  full-depth  ocean  observations  come  from 
dedicated  hydrographic  cruises  that  can  only  sample  a 
very  small  area  of  the  deep  ocean.  In  the  future,  these 
hydrographic  cruises  must  be  augmented  by  an  array  of 
deep  floats,  moored  instruments,  gliders,  or  a 
combination  thereof  (c.g.,  [53]).  Improvements  in 
remotely  sensed  measurements  of  vertically  integrated 
ocean  mass  (or  equivalently  bottom  pressure,  see  [22]) 
such  as  from  the  proposed  GRACE-11  mission  could 
also  help  constrain  the  deep  density  field  when 
combined  with  sea  level  and  upper  ocean  information. 

To  improve  estimates  of  climate  variations,  we  also 
need  to  enhance  information  about  boundary  currents 
and  transports  through  key  regions,  and  pursue  the 
satellite-derived  sea-ice  thickness  from  CryoSat-2  and 
lCESat-2  (e.g.,  [23]).  For  the  Arctic,  [23]  makes  the 
case  for  water  temperature  and  salinity  observations 
within  the  upper  800  m  in  the  deep  parts  of  the  Arctic 
Ocean  in  addition  to  observations  for  monitoring 
riverine  freshwater  fluxes 

In  addition  to  new  observations,  important  satellite 
measurements  such  as  altimetry,  gravity  (bottom 
pressure),  SST  from  both  microwave  (all-weather)  and 
infra-red  (high  resolution)  sources,  scatterometer  winds, 
sea-ice  concentrations,  and  ocean  colour  must  be 
maintained.  Even  with  Argo,  other  elements  of  the  in 
situ  observing  system  need  to  be  maintained  and 
enhanced.  In  particular,  the  global  tropical  moored  buoy 
array  is  essential  for  high  frequency  observations  for 
short-term  climate  forecast  initialization.  The  Tropical 
Atmosphere  Ocean  and  Triangle  Trans  Ocean  buoy 
Network  (TAO/TR1TON)  arrays  provide  the  backbone 
observing  system  for  seasonal  forecasts  [8].  These  need 


to  be  maintained  because  consistency  of  the  observing 
system  over  time  is  an  issue  in  the  calculation  of  drifts 
and  calibration  of  forecast  skill.  The  planned  tropical 
moored  buoy  array  in  the  Indian  Ocean,  the  Research 
Moored  Array  for  African-Asian-Australian  Monsoon 
Analysis  and  Prediction  (RAMA),  needs  to  be 
completed,  not  only  for  the  Indian  Ocean  variability 
itself,  but  also  to  aid  forecasts  of  intraseasonal 
variations  that  play  a  role  in  the  evolution  of  El  Nifto 
events.  Skillful  seasonal  forecasts  remain  a  challenge  in 
the  tropical  Atlantic  so  the  enhancements  to  the  Pilot 
Research  Moored  Array  in  the  Atlantic  (P1RATA)  [55] 
should  be  maintained  for  sufficient  duration  for 
significant  impacts  on  seasonal  forecasts  to  be 
established.  As  with  the  TAO/TR1TON  array,  any 
velocity  data  at  the  mooring  arrays  would  be  helpful  for 
independent  evaluation  of  the  assimilation  products.  Of 
course  the  ocean  velocity  field,  at  all  depths,  is  poorly 
observed.  Any  velocity  observations  throughout  the 
water  column  would  be  helpful  not  just  for  validation 
but  also  for  input  to  high  resolution,  operational 
oceanography  applications. 

New  satellite  measurements  like  that  of  sea  surface 
salinity  will  be  available  soon  and  need  to  be  pursued  in 
the  future.  The  Soil  Moisture  and  Ocean  Salinity 
(SMOS,  launched  November  2009)  and  upcoming 
Aquarius  (2010)  missions  will  provide  the  first  quasi¬ 
synoptic  view  of  ocean  surface  salinity,  which  should  be 
beneficial  to  ocean  synthesis.  This  will  provide  an 
additional  constraint  on  the  large-scale  surface  density 
field  and  also  on  the  mixed  layer  salinity  budget. 

The  extension  of  data  assimilation  systems  inshore  (see 
below)  assumes  that  coastal  observing  systems  will  be 
developed  to  support  them.  Cost-effective  in  situ  coastal 
observing  systems  will  be  a  challenge.  Many  satellite 
observations  also  have  issues  near  shore  The  technical 
challenges  for  coastal  altimetry  are  presented  in  [56], 
along  with  a  description  of  efforts  that  are  underway  to 
improve  products  in  the  coastal  zone  and  to  develop 
cal  ib  rat  ion/ validation  systems  for  new  data  products. 
Water  vapour  and  tidal  connections  arc  particularly  crucial, 
and  their  precision  needs  to  be  increased  in  the  coastal 
zone.  Further  work  is  needed  to  improve  satellite 
technology  and  capability  in  this  environment.  CryoSat- 
2  (to  be  launched  early  2010),  the  Sentincl-3  satellites 
and  the  Surface  Water  Ocean  Topography  (SWOT) 
mission  [57],  with  ability  to  resolve  high-resolution 
features,  hold  a  lot  of  promise  for  coastal  oceanography 
and  coastal  ocean  data  assimilation  [56]. 

3.2.  Surface  F'orcing 

Long  atmospheric  reanalyses  have  been  a  critical  source 
of  forcing  for  ocean  model  simulations  and  analyses. 
These  reanalyses  face  similar  issues  to  the  ocean 
historical  analyses  -  changing  observing  systems  and 
lack  of  error  estimates  that  only  compound  the 
problem  for  climate  analyses  and  predictions*.  The 


previous  generation  rcanalyses  (c.g.,  the  National 
Centers  for  Environmental  Prediction/National  Center 
for  Atmospheric  Research  (NCEP/NCAR)  Reanalysis 
and  ERA-40)  had  major  global  imbalances  in  heat  and 
freshwater  fluxes  [43].  The  latest  generation,  the 
Modem  Era  Retrospective-analysis  for  Research  and 
Applications  (MERRA)  [58]  and  ERA-lntcnm  [59]  arc 
improvements  in  this  regard,  but  arc  relatively  short  for 
ocean  climate  analyses,  covering  only  from  1979  and 
1989,  respectively,  to  the  present.  Although  improved 
from  previous  reanalyses,  they  still  show  impacts  of  the 
changing  observing  system  on  global  water  and  energy 
balances.  For  real-time  oceanographic  analyses,  it  is  the 
real-time  Numerical  Weather  Prediction  (NWP) 
analyses  and  forecasts  that  provide  the  needed  forcing. 

The  surface  flux  algorithms  used  in  atmospheric 
rcanalyses  and  NWP  tend  to  be  different  from  those 
derived  from  air-sea  interaction  experiments,  leading  to 
significant  differences  in  derived  flux  estimates.  Hence, 
the  development  of  improved  atmospheric  datasets  to 
force  global  ocean-ice  climate  models  is  a  key  area  that 
needs  continual  attention.  The  ocean  modelling 
community  has  tended  to  tackle  this  task  itself.  Three 
examples  are:  the  forcing  data  set  prepared  for  a  global 
occan-icc  model  comparison,  the  Coordinated  Ocean- 
ice  Reference  Experiments  (CORE)  [60]  using  the 
atmospheric  forcing  dataset  (based  on  the  NCEP/NCAR 
Rcanalysis)  compiled  by  [61]  and  updated  by  [62];  an 
analogous  dataset  based  on  ECMWF  reanalysis  [63]; 
and  the  DRAKKAR  Forcing  Sets  (DFS3  and  DSF4 
[64])  based  on  ERA -40  and  tuned  to  fit  the  needs  of  the 
DRAKKAR  model  configurations  [65].  The  Objectively 
Analyzed  air-sea  Fluxes  (OAFlux  [66])  is  another 
example  of  hybrid  products  that  blend  NWP,  satellite 
and  in  situ  variables.  Reference  [44]  points  out  that 
these  products  also  suffer  from  changes  in  the  observing 
system  and,  possibly,  changes  in  the  operational  NWP 
system.  Hence,  the  problem  of  inhomogeneity  remains 
in  diagnostics  of  decadal  variability  and  trends. 
Problems  can  also  arise  with  a  mismatch  of  scales  when 
data  from  difference  sources  arc  combined  in  flux 
algorithms. 

Satellite  observations  arc  a  key  source  of  surface  forcing 
data  Scattcrometcr  missions  arc  essential  for  estimates 
of  surface  momentum  and  turbulent  heat  fluxes. 
Impro\cmcnts  needed  for  the  coming  decade  include 
improved  sampling  at  high  wind  speeds  and  under  rain 
conditions  ([44],  [67])  and  resolution  of  the  diurnal  and 
inertial  forcing.  Remote  sensing  of  latent  and  sensible 
heating  remain  a  challenge  [68]  with  considerable 
uncertainty  in  how'  to  estimate  ncar-surface  air 
temperature  and  humidity  remotely  [69].  Measurements 
of  surface  shortwave  radiation,  and  its  penetration  into 
the  upper  ocean,  are  essential  to  support  simulations  of 
interactions  between  ocean  biology  and  physics. 
Satellite-derived  rainfall  estimates  arc  essential  for 


freshwater  flux  estimates.  However,  the  uncertainty  in 
precipitation  over  the  ocean  is  still  large  ~  differences 
between  satellite  estimates  and  NWP  products  can  be 
greater  than  10mm  month1  in  the  tropics  [70].  The 
planned  Global  Precipitation  Measurement  (GPM) 
mission  should  help  reduce  precipitation  uncertainties  in 
the  future. 

An  additional  challenge  is  the  estimation  of  fluxes 
through  sea  ice,  where  the  ocean  surface  climate  is 
noticeably  different  from  the  open  ocean  The  difficult 
environmental  conditions  at  high  latitudes  mean  that 
flux  observations  there  arc  extremely  scarce.  Existing 
gndded  flux  products  can  differ  substantially,  by  50 
Wm  2  or  more  in  the  case  of  heat  fluxes,  and  in  many 
cases  there  is  no  clear  consensus  about  which  flux 
products  arc  most  reliable  [71]  According  to  [71], 
desired  improvements  include:  increases  in  the  accuracy 
of  scattcrometcr  winds  at  high  wind  speed  and  of  heat 
fluxes  to  achieve  0.01  Nm‘  and  10  W  m2  accuracy 
(averaged  over  several  days)  with  25  km  grid  spacing. 
Achieving  this  will  require  a  concerted  plan  to  make 
better  use  of  ships  of  opportunity  to  collect 
meteorological  data,  a  targeted  effort  to  deploy  a  few 
flux  moorings  in  high  wind  regions,  and  improved 
satellite  retrievals  of  flux  related  variables 

In  situ  surface  measurements  arc  particularly  important 
for  calibration  of  satellite-derived  fluxes  and  evaluation 
of  NWP  and  rcanalysis  flux  estimates.  Reference  [44] 
summarizes  the  various  in  situ  sources  of  fluxes.  They 
recommend  expansion  of  the  surface  flux  reference 
network  under  OccanSITES  [72],  especially  in  higher 
latitudes  and  in  areas  with  severe  weather  conditions, 
and  expanding  the  ship-based  measurement  program 
The  increased  coverage  would  help  improve  NWP 
products  as  well  as  the  estimation  of  uncertainty  in  these 
products.  The  in  situ  measurements  arc  also  critical  for 
climate  quality  calibration  of  satellite  obsen  ations. 

Constraints  on  surface  fluxes  may  come  increasingly 
from  assimilating  ocean  data  However,  the 
entanglement  of  model  boundary  layer  error  with 
forcing  error  is  strong  and  care  is  needed  in  the 
interpretation  of  such  inferred  fluxes  C  ertainly  progress 
is  needed  in  consistent  estimation  of  air-sea  fluxes  using 
constraints  from  observations  in  the  atmosphere  and  the 
ocean,  perhaps  in  coupled  models.  However,  the  need 
for  attention  from  NWP  centres  in  continuously 
improving  their  surface  analyses,  especially  through 
improved  flux  paramctcrizations,  and  updated 
rcanalyses  remains  critical. 

3.3.  Modelling  and  Assimilation  Challenges 

Other  challenges  for  ocean  data  assimilation  lie  in  the 
covariance  modelling  of  background  (or  forecast)  and 
observation  errors,  including  representation  error,  and  in 
the  ability  of  the  observations  to  constrain  the  mcsoscalc 


ocean  variability.  Of  course  improving  the  models 
themselves  is  also  a  high  priority. 

The  estimates  of  model  and  data  errors  dictate  the 
outcome  of  the  estimation  product.  Therefore,  the  ocean 
state  estimation  community  needs  to  work  closely  with 
observational ists  to  obtain  robust  estimates  of  data 
errors  (including  biases),  an  important  issue  that  is  often 
left  to  assimilation  groups.  Of  course  representation 
error  is  a  function  of  the  model,  its  resolution,  and  the 
phenomena  of  interest  in  the  analysis.  Little  attention 
has  been  paid  to  the  representation  errors  for  in  situ  data 
to  date;  some  progress  is  being  made  with  satellite  data 
which  is  much  more  amenable  to  analysis  because  of  its 
better  sampling  characteristics  (e  g.,  [73],  [74]).  The 
new'  air-sea  flux  data  set  by  [75]  includes  estimates  of 
representativeness  errors  due  to  sampling  and  includes 
the  representativeness  errors  for  subdaily  variability  in 
its  random  error  estimates. 

Close  collaborations  between  the  assimilation, 
modelling,  and  observational  communities  are  needed  to 
understand  model  errors  better,  to  assess  where 
observations  and  models  diverge,  and  develop 
methodologies  to  resolve  differences.  Reference  [45] 
discusses  origins  of  biases  and  differences  between 
model  simulations  and  between  models  and 
observations:  shortcomings  in  grid  resolution,  both 
horizontal  and  vertical;  poor  numerical  algorithms, 
subgridscale  parameterizations,  or  representation  of 
other  climate  components  such  as  the  atmosphere, 
cryosphcrc,  ocean  biogeochemistry,  and  land  runoff; 
and  possibly  other  reasons  not  yet  identified. 

Of  central  importance  to  ocean  state  estimation  is  the 
ability  to  correct  both  temperature  and  salinity  and 
maintain  dynamical  balances.  This  has  been  achieved 
through  multivariate  assimilation  schemes  using 
empirical  orthogonal  functions  ([76],  [77],  [78],  [79])  or 
temperature-salinity  covariances  from  asymptotic 
Kalman  filter/smoothers  (e.g.,  [80])  or  ensemble-based 
filters  (eg.,  [81],  [82])  that  are  well  suited  to  handle 
nonstationary  stochastic  processes  in  which  the  error 
structure  of  Hows  is  highly  anisotropic  and  time- 
varying.  These  multivariate  relationships  can  also  be 
important  to  effective  assimilation  of  altimeter  data 
(e.g,  [83]).  Reference  [81]  shows  that  with  the 
multivariate  covariances  from  an  ensemble  filter  salinity 
errors  at  the  equator  are  reduced  by  45%,  vertical 
motion  errors  bv  81%,  and  the  undercurrent  errors  bv 
50%  compared  with  uni\ariaic  assimilation  of 
temperature  only.  Reference  [46]  discusses  the 
importance  of  multivariate  corrections  to  temperature, 
salinity  and  sea  level  and  the  positive  impact  on 
seasonal  forecast  skill.  References  [83]  and  [84]  show 
that  state-dependent  multivariate  covariances  can  be 
effectively  estimated  with  coupled  breeding  approaches 
and  that  improvements  in  the  salinity  state  estimates  and 
density  stratification  have  a  positive  impact  on  forecast 


skill  The  4DVar  approaches  implicitly  include  flow- 
dependent  relationships  and  balances  but  still  require 
estimates  of  the  background  and  model  error  statistics. 
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Figure  6  Impact  of  Argo  on  the  average  salinity  in 
upper  300  m  (psu)  in  the  ECMWF  S3  analysis:  (a) 
Argo  T  and  S,  and  (h)  Argo  T  only,  after  [85]. 

A  sense  of  the  critical  nature  of  the  treatment  of  salinity 
in  the  pre-Argo  era,  and  the  importance  of  Argo,  can  be 
gained  from  Fig.  6,  where  the  ECMWF  ocean  data 
assimilation  system  (S3  [28])  has  been  used  to  assess 
the  impact  of  Argo  salinity  observations  on  the  salinity 
analysis  in  the  upper  300  m  (S300).  There  arc  many 
areas  where  the  effect  of  temperature  on  S300  is 
contrary  to  the  direct  effect  of  using  salinity  [85].  S3 
uses  the  algorithm  from  [86]  to  correct  salinity  along 
isotherms  to  maintain  important  water  mass  properties, 
a  technique  that  [87]  shows  improves  upon  conventional 
depth-level  assimilation. 

The  quantification  of  model  errors  for  the  specification 
of  forecast  error  covariances  in  3DVar  or  Kalman 
filter/smoother  approaches  is  only  one  area  that  needs 
attention.  The  identification  of  model  error  sources  is 
also  critical  to  4Dvar  estimation  Some  model  errors  arc 
attributable  to  multiple  sources.  For  example,  a  biased 
SST  estimate  in  the  equatorial  Pacific  cold  tongue  could 
be  related  to  errors  in  wind,  surface  heat  flux,  or  mixing 
parameterizations  and  advection  (also  related  to 
resolution).  Determination  of  the  appropriate  “controls” 
and  correct  attribution  of  error  sources  arc  important  to 
the  fidelity  of  the  estimation  products. 

The  uncertainty  in  the  analysis  solution  arising  from 
assumptions  made  about  the  cost  function  the 
imposition  of  balance  and  other  constraints,  error 
specifications,  the  controls  is  as  yet  an  unexplored 


area  of  investigation.  A  related  question  is  the  impact  on 
the  solution  of  requiring  that  the  model  equations  be 
satisfied  so  that  budgets  can  be  balanced.  For  some 
climate  applications,  it  has  been  argued  that  forcing  the 
balance  is  necessary.  Most  ocean  assimilation  systems 
do  not  follow  this  approach.  For  atmospheric 
assimilation  systems,  the  tendency  has  been  to  use  the 
extent  of  imbalance  as  one  measure  of  the  quality  of  the 
system.  Ocean  assimilation  systems  would  benefit  from 
being  evaluated  in  a  similar  vein.  The  impacts  of  some 
of  these  choices  will  only  be  understood  through 
controlled  intercomparisons  where  the  same  model, 
data,  QC,  and  forcings  are  used. 

Although  many  real-time  operational  oceanography 
assimilation  efforts  use  resolutions  of  about  1/10°,  most 
of  the  products  for  climate  applications  have  resolutions 
that  are  too  coarse  to  represent  mesoscalc  eddies.  Since 
eddies  afTeet  climate  through  their  interaction  with  the 
larger  seales,  it  is  imperative  that  ocean  state  estimation 
for  climate  move  towards  eddy-permitting  resolutions. 
An  important  issue  is  that  the  in  situ  data  are  not 
adequate  to  constrain  the  mesoseale  and  the  emphasis  is 
placed  on  multiple  sources  of  altimeter  data  to  capture 
eddy  variability  (e.g.,  [88]).  The  future  SWOT  mission 
[57]  is  expected  to  provide  new  insight  to  ocean 
variations  at  scales  smaller  than  the  ~ 1 00km  scale 
currently  possible. 

Reference  [I]  outlines  other  issues  related  to  resolution 
for  operational  oceanography  as  the  user  community  is 
looking  to  extend  models  inshore,  across  the  shelf,  and 
into  bays  and  estuaries.  Attention  is  needed  for  better 
methods  for  nesting  models,  or  for  variable  resolution 
and  adaptive  grids.  Depending  on  the  relevant  dynamics 
of  the  situation  (c.g.,  local  topographic  effects,  wind 
wave,  tidal  currents,  land  freshwater  input,  etc.), 
assimilation  methods  have  to  be  developed  to  constrain 
not  only  the  large-scale  (quasi-)  geostrophic  field  but 
also  fields  influenced  by  small-scale  nonlinear 
processes 

4.  THE  FUTURE 

Ocean  data  assimilation  has  matured  in  significant  ways 
over  the  last  decade.  With  the  advances  made  with 
satellite  altimetry,  Argo  and  the  global  tropical  moored 
buoy  arrays,  and  upcoming  observations  such  as  the 
completion  of  the  RAMA  array  [18],  satellite 
measurements  of  surface  salinity  [89],  and  high 
resolution  ocean  surface  topography  [57],  we  can  be 
certain  that  further  significant  progress  will  emerge  in 
the  coming  decade.  Some  exciting  advances,  like  the 
contributions  being  made  to  observing  system 


evaluation/dcsign,  the  use  of  global  ocean  models  at 
true  eddy  resolving  resolution  (1/12°  or  better)  [88],  and 
the  developments  of  assimilation  with  coupled  oeean- 
atmospherc  models,  are  already  underway. 

4.1.  Integrated  Earth  System  Analyses 

The  various  observations  of  Earth's  environment  arc 
currently  assimilated  using  techniques  that  typically 
consider  the  components  of  the  Earth's  climate  system 
separately  As  such,  the  state  of  one  component  is  not 
usually  constrained  by  the  observations  and  dynamics  of 
other  components  (except  as  external  forcing)  and  the 
resulting  state  estimates  arc  not  necessarily  consistent 
across  the  various  components  This  hampers  research 
on  the  attribution  of  the  variability  and  changes  within 
the  coupled  system  and  limits  the  skill  of  climate 
prediction.  For  the  ocean,  the  issue  is  not  just  coupling 
with  the  atmosphere,  but  also  with  sea-ice  and  oecan 
biogeoehemistry,  and  also  with  the  land  surface. 

There  arc  many  issues  to  be  addressed  in  tackling  this 
problem,  not  least  of  which  arc  the  differences  in  time 
scales  for  the  oeean  and  atmosphere,  both  processes  and 
observations,  and  the  impact  of  physical  imbalances  on 
coupled  physical-biogeochcmical  models.  Another  area 
of  concern  is  the  biases  in  coupled  models  (c.g.,  [45]) 
that  will  have  a  detrimental  impact  on  state  estimates 
during  periods  that  arc  observation-challenged.  Many 
aspects  of  ocean  circulation  simulation  arc  improved 
with  resolution  in  oecan-only  models  (eg.,  [85]), 
however  resolution  does  not  necessarily  ameliorate 
important  biases  in  coupled  models  Nevertheless,  it  is 
the  role  of  data  assimilation  to  compensate  for  errors, 
including  biases  with  the  appropriate  formulation. 

There  arc  a  few  groups  pioneering  the  new  horizon  of 
estimation  using  coupled  ocean-atmosphere  models. 
Reference  [81]  uses  an  ensemble  Kalman  filter 
implementation  and  assimilates  a  pre-existing 
atmospheric  analysis  along  with  the  ocean  observations. 
Using  4DVar,  [90]  estimates  drag  (coupling) 
coefficients,  used  in  the  calculation  of  momentum  and 
heat  fluxes  between  the  ocean  and  atmosphere,  along 
with  the  ocean  and  atmospheric  states  In  addition  to 
improved  representation  of  fluxes  (c.g..  Fig  7)  and 
climate  variations,  the  coupled  assimilation  improves 
the  forecast  of  the  1997-98  El  Nifto  Reference  [91] 
assimilates  ocean  observations  into  a  coupled  model  and 
finds  an  improvement  in  several  atmospheric  fields  over 
those  from  an  atmosphere-only  run  using  observed 
SSTs.  Coupled  estimation  efforts  arc  expected  to  evolve 
markedly  as  the  climate  community  continues  to 
improve  initialization  for  seasonal  climate  prediction 
and  embarks  on  decadal  prediction 
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Figure  7.  Annual  averages  of  surface  latent  heat  flux  (W  m  ,  positive  upward)  taken  from  (a)  the  Comprehensive 
Ocean-Atmosphere  Data  Set,  (h)  coupled 4D-Var,  (c)  NCEP2  reanalysis,  and  (d)  ERA-40  reanalysis.  From  190]. 

unravelling  the  functions  of  the  marine  ecosystem  in  a 
changing  climate,  and  contributing  to  sustainable 
management  of  marine  resources  Reference  [95] 
provides  a  GODAE  view  of  recent  progress  in 
integrating  biogeoehemistry  and  ecology  into  ocean  data 
assimilation.  They  note  the  importance  of  resolution  and 
the  need  to  consider  additional  processes,  such  as  river 
runoff,  as  the  focus  moves  to  the  continental  margin. 
Coupling  biological  models  to  physical  hindcasts  from 
assimilative  models  have  sometimes  led  to  degraded 
simulations  of  biogeoehemical  variables  (e  g.,  [96], 
[97]).  However,  [97]  showed  that  their  combined 
assimilation  of  physical  and  nutrient  data  into  a  coupled 
physical-biogcoehemical  model  has  a  positive  impact  on 
modeled  phytoplankton  patterns  in  the  North  Atlantic. 

Examples  of  remotely  sensed  ocean  colour  assimilation 
include  [98]  and  [99],  in  which  global  syntheses  of 
satellite-derived  chlorophyll-a  have  been  undertaken 
with  the  NASA  Oeean  Biogeoehemical  Model  (Eig.  8) 
Comparison  with  independent  in  situ  data  for  the  period 
1998-2004  shows  a  significant  improvement  of  the 
chlorophyll  estimate.  In  [98],  the  global  RMS  log  error 
of  chlorophyll  estimated  by  the  model  is  reduced  by  the 
assimilation  from  53%  to  13%  above  the  error  of 
SeaWiFS  Regionally,  the  assimilation  estimate  exhibits 
smaller  errors  than  SeaWiFS  data  in  several  ocean 
basins. 


Data  assimilation  is  also  important  for  estimating  sea- 
ice  concentrations,  which,  like  SST,  are  needed  as  lower 
boundary  conditions  for  the  atmosphere  as  well  as  for 
ocean  state  estimation  [92]  Simple  analysis  techniques 
have  been  used  in  weather  prediction  centres  for  some 
time.  Now,  more  advanced  techniques  (ineluding  an 
ensemble  Kalman  filter  [93]  and  a  variational 
implementation  [94])  arc  starting  to  emerge  as  ocean 
models  and  assimilation  systems  evolve  to  be  truly 
global.  As  an  example,  [92]  presents  results  from  the 
hybrid  experiment  of  [94]  where  an  ensemble  Kalman 
filter  is  used  to  estimate  parameters  for  the  background 
error  covariances  used  in  the  3DVar. 

Future  developments  in  sea-ice  assimilation  will  be  in 
two  main  areas.  The  first  is  the  development  of 
techniques  to  incorporate  as  many  operational 
observations  as  feasible.  Currently,  the  observations  that 
have  been  assimilated  are  those  from  passive 
microwave  instruments,  ice  charts  and  ice  drift.  Other 
observations,  such  as  active  radar  and  satellite-derived 
sea-ice  thickness  from  the  future  CryoSat-2  and  ICESat- 
2,  will  be  incorporated  over  the  coming  years.  The 
second  development  area  lies  in  improving  the  prior 
estimate  provided  by  coupled  atmosphere-icc-ocean 
models  that  propagate  information  from  past 
observations. 

With  the  growth  of  operational  oceanography  the 
demand  has  also  been  growing  for  data  and  information 
relevant  to  understanding  the  global  marine  ecosystem. 


Assentation  *  Chkvophyll  June  15  2001 


Figure  8.  Total  surface  chlorophyll  (mg/nr  )  for  June  15,  2001  from  (a)  a  model  simulation ,  ( b )  an 
assimilation  of  SeaWiFS  data,  and  (c)  SeaWiFS.  White  indicates  sea  ice.  The  assimilation  significantly 
improves  the  chlorophyll  estimate  of  the  free-running  model  simulation  From  [98]. 


During  the  next  decade  coupled  physical- 
biogeochemical  assimilation  ean  be  expeeted  to  mature, 
providing  new  insights  not  only  to  ocean  biological 
variations  and  the  marine  earbon  cyelc  but  also  into  the 
feedbacks  within  the  physical  climate  system.  To  make 
further  progress,  issues  to  be  addressed  include  the  need 
to  improve  oeean  model  representation  of  some  physical 
variables  such  as  upper-oeean  vertical  fluxes  that  arc 
critically  important  to  biologieal  activity.  The  lack  of 
relevant  observations  is  probably  the  most  severe 
limitation  to  expansion  of  operational  systems  to 
biogeoehemieal  and  eeosystem  applications.  Thus, 
ocean  observ  ing  systems  will  need  to  be  expanded  with, 
for  example,  intensified  deployments  of  in  situ  sensors 
for  O2  and  chlorophyll,  and  inclusion  of  new  sensors  for 
nutrients,  zooplankton,  etc.,  and  new  satellite  missions 
for  ocean  color,  lidar  systems  for  mixed-layer  depths, 
and  wide-swath  altimeters  for  coastal  sea  levels. 
Improved  methods  will  need  to  be  developed  to 
assimilate  these  new  measurements. 

The  pathway  to  a  fully  integrated  Barth  system  analysis 
will  continue  to  proceed  incrementally,  with  advances  in 
coupling  of  two  or  three  components  at  a  time. 

4.2.  Realizing  the  potential  of  analyses  and  models  in 
observing  system  design 

An  important  and  emerging  role  for  ocean  data 
assimilation  is  the  assessment  of  the  contribution  of  the 


various  components  of  the  observing  system  and 
scientific  guidance  for  improved  design  and 
implementation  of  the  ocean  observing  system  As 
ocean  models  and  assimilation  systems  have  improved 
in  quality  over  the  years,  ocean  analysis  and  forecasting 
systems  and  associated  tools,  such  as  adjoint  sensitivity 
diagnostics,  have  become  powerful  means  to  assess  the 
impaet  of  the  observing  system  for  particular 
applications,  to  identify  gaps  and  ultimately  improve  the 
effieieney/effeetiveness  of  the  observing  system 

Observing  System  Experiments  (OSHs)  where  different 
components  of  the  GOOS  arc  systematically  withheld 
ean  help  quantify  the  impact  of  each  observation  type 
on  the  quality  of  analyses  or  the  skill  of  a  forecast. 
Various  other  techniques,  including  observing  system 
simulation  experiments  (OSSEs),  adjoint-  and 
ensemble-based  approaches,  can  be  used  to  aid  the 
design  and  evaluation  of  oeean  observing  systems. 
Examples  of  various  evaluations  of  the  observing 
system  ean  be  found  in  [27],  [46]  and  [100]. 

One  recurring  result  from  different  OSEs  includes  the 
complementary  nature  of  different  observation  types 
(c.g.,  [26],  [28],  [101]).  For  example,  [26]  performed  a 
scries  of  OSEs  to  eomparc  the  relative  impact  of  Argo, 
SST  and  sea  level  anomaly  (SLA)  observations  on  an 
eddy-resolving  oeean  rcanalysis.  They  systematically 
withheld  altimeter,  Argo  and  SST  observations.  They 
found  that  satellite  SST  observations  arc  the  only 
observation  type  considered  that  have  the  potential  to 


constrain  the  circulation  in  shallow  seas  and  over  wide 
continental  shelves;  altimetry  is  the  only  observation 
type  that  even  goes  close  to  constraining  the  mcsoscale 
ocean  circulation;  and  Argo  observations  are  the  only 
observation  type  that  constrains  sub-surface  temperature 
and  salinity  (Fig.  9).  None  of  the  observation  types  in 
the  GOOS  was  found  to  be  redundant.  Each  different 
observation  type  brings  unique  contributions  to  the 
GOOS  and  all  observation  types  should  be  routinely 
assimilated  by  forecast  and  rcanalysis  products  and, 
most  importantly,  maintained  by  the  international 
community.  The  consistent  use  of  and  impaet  from  SST 
observations  is  a  eredit  to  the  GODAE  High  Resolution 
SST  (GHRSST)  program  that  provides  high  level 
quality  controlled  SST  data  in  near  real-time  [21]. 


Figure  9.  Preliminary  estimates  of  the  Information 
Content  (IC;  %).  degrees  of freedom  of  signal  (DFS) 
and  the  number  of  assimilated  super-observations  (# 
Obs)  for  the  Bluelink  reanalvsis  system  in  the  region 
90-1 8(fE,  6(f}S-equator,  computed  for  l  January > 
2006.  The  scale  for  the  1C  is  to  the  left  and  the  scale 
for  the  DFS  and  #  Obs  is  to  the  right,  from  [100]. 

Another  result  that  is  common  to  many  studies  is  the 
necessity  of  assimilation  of  altimeter  data  to  represent 
mesoscale  variability  (e.g.,  [26],  [33],  [102])  Reference 

[103]  has  shown  that  four  altimeters  arc  needed  in  real 
time  to  get  similar  quality  performance  as  two 
altimeters  in  delayed  time 

A  scries  of  OSEs  using  the  1/9°  UK  Met  Office  system 
quantify  the  analysis  improvements  according  to  the 
number  of  altimeters  used  in  the  assimilation  (Fig.  10). 
The  impact  of  different  numbers  of  altimeters  was 
assessed  by  comparing  the  analyzed  SLA  with  the 
assimilated  along-track  altimeter  data,  and  the  analyzed 
surface  velocities  with  those  derived  from  surface 
drifting  buoys  (which  were  not  assimilated).  The 
addition  of  the  first  altimeter  has  the  greatest  impact. 
The  results  arc  different  for  different  regions. 
Mcsoscale  dynamics  in  the  Northeast  Atlantic  seem  to 
be  constrained  better  by  the  altimeters  than  in  the 
Northwest  Atlantic. 

The  potential  impact  of  remotely  sensed  sea  surface 
salinity  (SSS)  from  SMOS  or  Aquarius  on  the  forecast 


skill  of  the  Mercator  Ocean  system  has  been  assessed  by 

[104]  through  a  series  of  OSSEs.  They  concluded  that 
the  level  of  observation  error  is  critical  to  the  impact  of 
this  new  observation  type,  consistent  with  the  result  of 

[105]  which  assessed  the  theoretical  impact  of  SSS 
observations. 

Other  than  for  seasonal  forecast  skill,  little  has  been 
done  to  assess  the  value  of  ocean  observing  capabilities 
with  respeet  to  large-scale  ocean  circulation  diagnostics 
and  their  fluctuations  on  climate  timescales  (i.c.  decadal 
and  beyond).  Some  preliminary  analysis  of  OSEs  can  be 
found  in  [27],  [106],  [107]  and  [108].  Many  outstanding 
questions,  such  as  the  optimal  ocean  observing  system 
for  estimating  long-term  freshwater  and  heat  transports 
within  the  global  climate  system,  remain. 

Even  in  the  atmospheric  community  where  OSSEs  have 
a  longer  history,  the  usefulness  of  OSSEs  is  not 
universally  accepted.  Certainly  care  is  needed  in  the 
interpretation,  and  the  careful  simulation  of  the 
observations  with  realistic  error  characteristics  from  a 
simulation  that  has  been  validated  to  some  extent  in 
terms  of  its  representation  of  nature  is  essential. 
Comparison  of  results  from  several  systems  to  assess 
the  robustness  of  the  results  is  also  important  This  is 
also  important  for  OSEs. 

4.3.  Monitoring  the  Ocean  and  the  Observ  ing  System 

Ocean  data  assimilation  provides  a  comprehensive  and 
powerful  approach  to  monitoring  the  global  ocean  and, 
as  such,  is  an  important  adjunct  to  the  observ  ing  system. 
For  the  future,  we  envision  a  regular  evaluation  of  the 
state  of  the  ocean  over  the  full  water  column,  providing 
information  about  important  ocean  indiecs  to  both 
science  and  application  communities  on  a  regular  basis. 

In  addition,  it  is  expected  that  the  assimilation  systems 
and  associated  tools  will  provide  routine  mechanisms 
for  evaluation  of  the  observing  system  As  a  guide  to 
tools  that  may  be  developed,  the  ocean  community  can 
look  to  the  emerging  tools  in  the  atmospheric 
assimilation  community.  These  tools  represent 
diagnostics  from  analysis  and  forecasts  systems  that  arc 
relatively  inexpensive  to  compute  1  or  example, 
analysis  sensitivity  experiments  and  adjoint  tools  can 
quantify  the  impact  of  each  individual  observation  on 
an  analysis  and  forecast  (e.g,  [109]),  albeit  through  a 
single  pre -determined  metric  The  identification  of 
persistent  problems  (eg,  negative  impacts)  with 
particular  observations  may  indicate  sensor  drifts, 
particularly  if  consistent  across  several  assimilation  and 
forecast  systems.  Reference  [110]  shows  how  the 
adjoint  tools  in  conjunction  with  OSEs  can  provide 
insight  into  the  synergy  between  different  observation 
types.  These  tools  are  now'  being  implemented  into  the 
U.S  Navy's  ocean  analysis  and  forecast  systems. 
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Figure  10  Anomaly  correlation  between  forecast  SLA  and  along-track  altimetric  SLA  from  all  satellites  (left)  and 
forecast  near-surface  velocity  and  near-surface  velocity  derived  from  drifting  buoys  (right)  The  results  are  based 
on  a  series  of  OSEs  that  assimilate  SLA  data  from  0-3  satellites,  using  the  1/9 0  North  Atlantic  FOAM  configuration 
for  the  first  3  months  of 2006. 


Ocean  syntheses  should  also  be  used  in  improving  data 
quality  by,  for  example,  providing  the  “background" 
used  in  statistical  checks.  This  feedback  loop  could  be 
important  in  ensuring  that  ocean  syntheses  provide  the 
best  possible  analyses  of  current  and  past  climate.  It 
might  also  help  refine  bias  corrections  of  observations 
such  as  those  needed  for  historical  XBTs. 

Because  the  GOOS  is  constantly  changing,  results  from 
OSEs  quickly  become  obsolescent  and  a  new  paradigm 
is  needed  to  monitor  and  evaluate  the  GOOS  [100].  The 
GODAE  OeeanView  community  is  shifting  their  efforts 
to  transition  their  OSE/OSSE  activities  towards  routine 
monitoring  of  the  GOOS.  Some  initial  steps  have  been 
taken  to  coordinate  these  activities.  Specifically, 
agreement  is  sought  on:  how  GODAE  partners  can  and 
should  move  towards  routine  monitoring  of  the  GOOS; 
how  this  can  be  coordinated  between  the  international 
groups;  and  a  staged  plan  for  moving  these  activities 
towards  routine  monitoring,  so  that  the  GODAE 
OeeanView  community  can  influence  the  ongoing 
design  and  assessment  of  the  GOOS. 

5.  RECOMMENDATIONS 

Several  recommendations  are  made  here  to  improve  the 
GOOS,  the  collaborations  between  the  assimilation  and 
observation  communities,  and  the  assimilation  systems 
themselves.  With  these  improvements  we  can  realize  the 
potential  of  assimilation  systems  to  synthesize 
observations  so  as  to  provide  information  that  is  more 
extensive  than  the  information  that  can  be  gleaned  from 
individual  observations  -  for  an  impact  on  operational 
oceanography  and  climate  analysis  and  forecasts. 

5.1.  Recommendations  on  observations 

•  Systematic,  sustained  observations  of  the  ocean  and 
its  forcing  are  critical  to  the  improvement  of  decadal 


and  longer-term  ocean  state  estimation.  At  minimum, 
the  existing  GOOS  must  be  maintained  For  the  global 
observing  system  in  support  of  real  time  and  climate 
analyses,  the  requirement  is  for  the  maintenance  of 
Argo  as  well  as  important  satellite  measurements  such 
as  altimetry,  gravity  (bottom  pressure),  microwave  and 
infrared-based  SST,  scattcromcter  winds,  precipitation, 
ocean  colour,  and  microwave-based  sca-iec 
concentration.  In  addition,  the  global  tropical  moored 
buoy  arrays  (including  RAMA  and  PI  RATA)  arc 
essential  for  their  provision  of  high  frequency 
observations  for  short-term  climate  forecast 
initialization. 

•  To  improve  future  ocean  climate  estimates,  the  in 
situ  observing  system  should  be  extended  to  include 
full-depth  Argo-type  measurements.  Measurements  in 
marginal  seas  and  ice-covered  regions  arc  important  for 
both  real-time  and  climate  analyses 

•  In  addition  to  the  sustained  observations,  the 
development  of  new  observing  systems  such  as 
measurements  of  sea-ice  thickness  from  space  and  wide 
swath  altimetry  (SWOT)  are  encouraged. 

•  Improvements  in  altimetry  for  coastal  regions,  from 
existing  measurements  and  especially  from  future  high- 
resolution  altimetry,  should  be  pursued 

•  Observations  are  needed  for  assimilation  into  ocean 
biogeoehemistry  and  ecosystem  models:  deployments  of 
in  situ  sensors  for  02  and  chlorophyll,  and  inclusion  of 
new  sensors  for  nutrients,  zooplankton,  etc.,  and 
hyperspectral  satellite  measurements  of  ocean  colour 

5.2.  Recommendations  on  surface  forcing 

•  The  surface  flux  reference  network  under 
OccanSITES,  especially  in  higher  latitudes  and  in  areas 
with  severe  weather  conditions,  should  be  expanded 


along  with  the  ship-based  measurement  program  The 
increased  coverage  would  help  improve  NWP  products 
as  well  as  the  estimation  of  uncertainty  in  these 
products.  Increased  coverage  will  also  improve  satellite 
calibration. 

•  Satellite  measurements  of  air-sea  fluxes  should 
continue  to  be  a  priority,  particularly  seatterometer 
winds  and  global  precipitation 

•  Atmospheric  rcanalysis  projects  should  continue  to 
be  updated  and  attention  should  be  placed  on  improving 
surface  flux  estimates  in  NWP  and  rcanalysis  products. 

•  Land  freshwater  input  to  the  ocean  (ice  melting, 
river  runoff,  ground  water  seepage)  needs  to  be  better 
determined 

5.3.  Recommendations  on  the  input  data  streams 

•  Standard  QC  procedures  should  be  developed. 

•  Appropriate  metadata  needs  to  be  included  with  each 
observation  so  that  its  heritage  and  the  history  of 
corrections  arc  available  to  assimilation  groups. 

5.4.  Recommendations  on  modelling  and  data 
assimilation 

•  Much  needs  to  be  done  over  the  next  decade  to 
characterize  the  uncertainties  in  each  synthesis  product 

•  To  fully  understand  the  differences  between  the 
analysis  products  and  to  make  further  improvements  in 
assimilation  capabilities,  a  concerted  comparison  effort 
may  is  needed  wherein  runs  arc  undertaken  with  the 
same  data  and  forcing  and  the  analysis  diagnostics  are 
expanded  to  look  at  innovations,  residuals  and  the 
details  of  data  impacts. 

•  Progress  needs  to  be  made  in  dynamically 
consistent  coupled  atmosphcrc/occan/sea-icc  estimation 
to  provide  both  a  consistent  view  of  Earth’s  climate 
variability  and  to  improve  the  initialization  for  coupled 
climate  predictions. 

•  Ocean  state  estimates  should  be  maintained  and 
viewed  as  an  integral  part  of  the  ocean  observing  and 
information  system. 
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